Systems and methods for using metadata to search for related computer infrastructure components

ABSTRACT

A computer system for crawling computer infrastructure inventory data includes a processor and a memory device coupled to the processor. The computer system also includes an inventory database system stored on the memory device. The inventory database system includes computer-executable instructions allowing the computer to manage stored records. The computer system is configured to (a) retrieve an inventory record from the inventory database system, the inventory record containing inventory data, (b) determine that the inventory data contains relational inventory metadata, (c) determine that the relational inventory metadata indicates at least one related inventory record, and (d) perform step (a) on the at least one related inventory record.

BACKGROUND OF THE INVENTION

The field of the invention relates generally to management of computerdevices and, more particularly, to network-based systems and methods forscanning computer devices to collect inventory data including hardwareand software objects associated with the computer devices.

At least some known computing infrastructures for medium to largebusiness entities include hundreds or thousands of computer devices,from mission-critical servers hosting production applications topersonal computers (PCs) used by support personnel. Managing suchcomputing infrastructures requires knowledge of the types of hardwareand software objects deployed across the environment. Asset managersoftentimes need to maintain inventory information on the computingdevices in service, and need to track devices as they enter and exitservice. System administrators may need to track operating systemversions and patch levels for the computing devices, as well as whichapplications and versions are installed on each computing device.

One known method of maintaining such inventory information is the manualtracking and recording of information in a spreadsheet or a database.Tracking asset information manually can be significantly time consuming,and is subject to human error. Further, changes in the infrastructurelead to stale data that may not be immediately corrected, and may gounrealized for long periods of time. Other known methods of maintaininginventory information utilize inventory management applications. Suchapproaches often require purchasing a management product, and installinga pre-compiled agent on each computing device in the infrastructure.These agents may not be available for all operating systems present inan infrastructure. When these agents are installed, these known agentsare oftentimes not customizable enough for certain tasks, and typicallyimpact the computational capacity of the hosts on which they run.

Accordingly, it is desirable to have a tool set that can be deployedonto each computing device such that each computing device can thenexecute a common scan program to collect similar inventory data, withminimal computational overhead on the executing host.

BRIEF DESCRIPTION OF THE INVENTION

In one aspect, a computer-implemented method for crawling computerinfrastructure inventory data is provided. The method is implemented bya computing device coupled to a memory device. The method includes (a)using the computing device to retrieve an inventory record from aninventory database system, the inventory record containing inventorydata, (b) determining that the inventory data contains relationalinventory metadata, (c) determining that the relational inventorymetadata indicates at least one related inventory record, and (d)performing step (a) on the at least one related inventory record.

In another aspect, a computer system for crawling computerinfrastructure inventory data is provided. The computer system includesa processor and a memory device coupled to the processor. The computersystem also includes an inventory database system stored on the memorydevice. The inventory database system includes computer-executableinstructions allowing the computer to manage stored records. Thecomputer system is configured to (a) retrieve an inventory record fromthe inventory database system, the inventory record containing inventorydata, (b) determine that the inventory data contains relationalinventory metadata, (c) determine that the relational inventory metadataindicates at least one related inventory record, and (d) perform step(a) on the at least one related inventory record.

In a further aspect, computer-readable storage media for crawlingcomputer infrastructure inventory data is provided. Thecomputer-readable storage media has computer-executable instructionsembodied thereon. When executed by at least one processor, thecomputer-executable instructions cause the processor to (a) retrieve aninventory record from an inventory database system, the inventory recordcontaining inventory data, (b) determine that the inventory datacontains relational inventory metadata, (c) determine that therelational inventory metadata indicates at least one related inventoryrecord, and (d) perform step (a) on the at least one related inventoryrecord.

In another aspect, a computer-implemented method for scanning computerinfrastructure within a computer network is provided. The computernetwork includes a first host device having a first operating system anda second host device having a second operating system distinct from thefirst operating system. The first host device and the second host deviceare coupled to a controller server. The method comprises the step ofdeploying a first scan program to the first host device and the secondhost device. The first scan program is configured to gather and storeinventory data on a host device. The method further comprises the stepof installing a first tool set on the first host device. The first toolset is configured to enable the first scan program to execute on thefirst operating system. The method also comprises the step of installinga second tool set on the second host device. The second tool set isconfigured to enable the first scan program to execute on the secondoperating system. The method further comprises the step of executing thefirst scan program on the first host device for gathering and storing afirst set of inventory data on the first host device. The method alsocomprises the step of executing the first scan program on the secondhost device for gathering and storing a second set of inventory data onthe second host device. The method further comprises the step ofcollecting the first set of inventory data from the first host deviceand the second set of inventory data from the second host device. Thecollecting is performed by the controller server.

In another aspect, a network-based system for scanning computerinfrastructure within a computer network is provided. The systemcomprises a first scan program configured to gather and store inventorydata on a host device. The system also comprises a first host devicecomprising a first operating system and a first tool set. The first toolset is configured to enable the first scan program to execute on thefirst operating system. The system further comprises a second hostdevice comprising a second operating system distinct from the firstoperating system and a second tool set. The second tool set isconfigured to enable the second scan program to execute on the secondoperating system. The system also comprises a controller server coupledto the first host device and the second host device. The controllerserver is configured to deploy the first scan program to the first hostdevice and the second host device. The controller server is alsoconfigured to execute the first scan program on the first host devicefor gathering and storing a first set of inventory data on the firsthost device. The controller server is further configured to execute thefirst scan program on the second host device for gathering and storing asecond set of inventory data on the second host device. The controllerserver is also configured to collect the first set of inventory datafrom the first host device and the second set of inventory data from thesecond host device.

In yet another aspect, computer-readable storage media havingcomputer-executable instructions embodied thereon are provided. Thecomputer-executable instructions, when executed by at least oneprocessor, cause the processor to deploy a first scan program to a firsthost device and a second host device. The first scan program isconfigured to gather and store inventory data on a host device. Thecomputer-executable instructions also cause the processor to install afirst tool set on the first host device. The first tool set isconfigured to enable the first scan program to execute on the firstoperating system. The computer-executable instructions further cause theprocessor to install a second tool set on the second host device. Thesecond tool set is configured to enable the first scan program toexecute on the second operating system. The computer-executableinstructions also cause the processor to execute the first scan programon the first host device for gathering and storing a first set ofinventory data on the first host device. The computer-executableinstructions further cause the processor to execute the first scanprogram on the second host device for gathering and storing a second setof inventory data on the second host device. The computer-executableinstructions also cause the processor to collect the first set ofinventory data from the first host device and the second set ofinventory data from the second host device. The collecting is performedby the controller server.

In a further aspect, a computer-implemented method for storing computerinfrastructure inventory data is provided. The method is implemented bya computing device coupled to a memory device and a database systemstored on the memory device. The database system includescomputer-executable instructions allowing the computing device to managestored records. The method includes receiving an inventory fileassociated with a scan of a host device at the computing device. Themethod also includes receiving a mapping schema associated with theinventory file at the computing device. The mapping schema comprises astructured relationship description between the inventory file and aninventory record. The method further includes translating, at thecomputing device, the inventory file to the inventory record using themapping schema. The method additionally includes updating the databasesystem with the inventory record.

In yet another aspect, a computer for storing computer infrastructureinventory data is provided. The computer includes a processor and amemory device coupled to the processor. The computer also includes adatabase system stored on the memory device. The database systemincludes computer-executable instructions allowing the computer tomanage stored records. The computer is configured to receive aninventory file associated with a scan of a host device. The computer isalso configured to receive a mapping schema associated with theinventory file. The mapping schema comprises a structured relationshipdescription between the inventory file and an inventory record. Thecomputer is further configured to translate the inventory file to theinventory record using the mapping schema and to update the databasesystem with the inventory record.

In a further aspect, computer-readable storage media for storingcomputer infrastructure inventory data is provided. Thecomputer-readable storage media has computer-executable instructionsembodied thereon. When executed by at least one processor, thecomputer-executable instructions cause the processor to receive aninventory file associated with a scan of a host device. Thecomputer-executable instructions also cause the processor to receive amapping schema associated with the inventory file wherein the mappingschema comprises a structured relationship description between theinventory file and an inventory record. The computer-executableinstructions further cause the processor to translate the inventory fileto the inventory record using the mapping schema and to update adatabase system with the inventory record.

BRIEF DESCRIPTION OF THE DRAWINGS

The Figures listed below show example embodiments of the methods andsystems described herein.

FIG. 1 is a simplified block diagram of an example embodiment of asystem for scanning infrastructure for inventory data in accordance withone embodiment of the present invention.

FIG. 2 is an expanded block diagram of an example embodiment of a serverarchitecture of a system in accordance with one embodiment of thepresent invention.

FIG. 3 illustrates an example configuration of a client system shown inFIGS. 2 and 3.

FIG. 4 illustrates an example configuration of a server system shown inFIGS. 2 and 3.

FIG. 5a is a flowchart illustrating an example process utilized by thesystem shown in FIG. 1 for scanning infrastructure for inventory data.

FIG. 5b is a flowchart illustrating another embodiment of the processshown in FIG. 5a for scanning infrastructure for inventory data.

FIG. 6 is a flowchart illustrating an example process utilized by thesystem shown in FIG. 1 for storing computer infrastructure inventorydata scanned using the process in FIG. 5 a.

FIG. 7 is a block diagram of an example computer infrastructure withrelationships between infrastructure components which may be scannedusing the process in FIG. 5a and stored using the process in FIG. 6.

FIG. 8 is a flowchart illustrating an example process utilized by thesystem shown in FIG. 1 for crawling computer infrastructure inventorydata stored using the process in FIG. 6 to identify relationships incomputer infrastructure inventory such as relationships shown in thearchitecture of FIG. 7.

DETAILED DESCRIPTION OF THE INVENTION

Described in detail herein are example embodiments of systems andmethods for scanning computing infrastructure for use in collectinginventory data associated with various types of computing hardware andsoftware. The systems and methods facilitate, for example, collectingoperating system and application inventory information about computingdevices using a common scan program and merging the inventoryinformation into a common database. A technical effect of the systemsand methods described herein include at least one of (a) automaticallycollecting inventory information from a network of heterogeneouscomputing devices; (b) standardizing the types of inventory informationcollected within a single inventory format; (c) centralizing storage ofinventory information; (d) centralizing and coordinating the regularcollection of inventory information; (e) facilitating distribution ofnew versions of scan software to enable dynamic changes in the type ofinformation collected; (f) leveraging a standard set of tools acrossplatform types to enable a single script to support multiple platforms;(g) facilitating crawling of inventory records to determine linksbetween inventory records; (h) defining links between inventory records;and (i) enabling effective inventory management using linked inventoryrecords.

More specifically, the technical effects can be achieved by performingat least one of the following steps: (a) deploying a scan program to aplurality of hosts; (b) installing on a host device a first tool setconfigured to enable the scan program to run on a first operatingsystem; (c) installing on another host device a second tool setconfigured to enable the scan program to run on a second operatingsystem; (d) executing the scan program on both host devices to generateinventory data sets; (e) collecting the inventory data sets on acontroller server; (f) deploying the scan program to host devices toreplace the scan program; (g) monitoring the operation of the scanprograms during execution; (h) receiving an inventory file associatedwith a scan of a host device; (i) receiving a mapping schema associatedwith the inventory file wherein the mapping schema comprises astructured relationship description between the inventory file and aninventory record; (j) translating the inventory file to the inventoryrecord using the mapping schema; and (k) updating the database systemwith the inventory record.

As used herein, the term “inventory information” refers to datadescribing characteristics associated with computing resources, and mayinclude information related to an operating system running on thecomputing resource, information related to an application running on thecomputing resource, information related to an application server runningon the computing resource, information related to a web server runningon the computing resource, information related to load balancer runningon the computing resource, information related to security certificateson the computing resource, information related to network hierarchies(e.g., DNS records) on the computing resource, information related tohardware security assets running on the computing resource, informationrelated to IP addresses bound to the computing resource, informationrelated to other services on the computing resource, and hardwareassociated with the computing resource.

As used herein, the terms “inventory database system,” “databasesystem,” and “inventory database” refer to database systems used tostore data associated with inventory data scanned using the methodsdescribed herein. Such database systems are further used to facilitatecrawling to find related computer infrastructure inventory components.As used herein, these terms may be used interchangeably.

As used herein, a processor may include any programmable systemincluding systems using micro-controllers, reduced instruction setcircuits (RISC), application specific integrated circuits (ASICs), logiccircuits, and any other circuit or processor capable of executing thefunctions described herein. The above examples are example only, and arethus not intended to limit in any way the definition and/or meaning ofthe term “processor.”

As used herein, the term “database” may refer to either a body of data,or to a relational database management system (RDBMS), or both. As usedherein, a database may include any collection of data includinghierarchical databases, relational databases, flat file databases,object-relational databases, object oriented databases, and any otherstructured collection of records or data that is stored in a computersystem. The above examples are example only, and thus are not intendedto limit in any way the definition and/or meaning of the term database.Examples of RDBMS's include, but are not limited to including, Oracle®Database, MySQL®, IBM® DB2, Microsoft® SQL Server, Sybase®, andPostgreSQL. However, any database may be used that enables the systemsand methods described herein. (Oracle and MySQL are registeredtrademarks of Oracle Corporation, Redwood Shores, Calif.; IBM is aregistered trademark of International Business Machines Corporation,Armonk, N.Y.; Microsoft is a registered trademark of MicrosoftCorporation, Redmond, Wash.; and Sybase is a registered trademark ofSybase, Dublin, Calif.) As used herein, the term “database system”refers specifically to a RDBMS.

In one embodiment, a computer program is provided, and the program isembodied on a computer readable medium. In an example embodiment, thesystem is executed on a single computer system, without requiring aconnection to a sever computer. In a further example embodiment, thesystem is being run in a Windows® environment (Windows is a registeredtrademark of Microsoft Corporation, Redmond, Wash.). In yet anotherembodiment, the system is run on a mainframe environment and a UNIX®server environment (UNIX is a registered trademark of X/Open CompanyLimited located in Reading, Berkshire, United Kingdom). The applicationis flexible and designed to run in various different environmentswithout compromising any major functionality. In some embodiments, thesystem includes multiple components distributed among a plurality ofcomputing devices. One or more components may be in the form ofcomputer-executable instructions embodied in a computer-readable medium.The systems and processes are not limited to the specific embodimentsdescribed herein. In addition, components of each system and eachprocess can be practiced independent and separate from other componentsand processes described herein. Each component and process can also beused in combination with other assembly packages and processes.

The following detailed description illustrates embodiments of theinvention by way of example and not by way of limitation. It iscontemplated that the invention has general application to managingcomputing infrastructures.

As used herein, an element or step recited in the singular and proceededwith the word “a” or “an” should be understood as not excluding pluralelements or steps, unless such exclusion is explicitly recited.Furthermore, references to “example embodiment” or “one embodiment” ofthe present invention are not intended to be interpreted as excludingthe existence of additional embodiments that also incorporate therecited features.

FIG. 1 is a simplified block diagram of an example inventory collectionsystem 100, including a plurality of computer devices in accordance withone embodiment of the present invention. More specifically, in theexample embodiment, system 100 includes a controller server 112 and aplurality of client sub-systems, also referred to as “hosts” 114,connected to controller server 112. In one embodiment, hosts 114 arecomputing devices communicatively coupled to controller server 112through a network 115, such an such as a local area network (LAN) or awide area network (WAN), dial-in-connections, cable modems, and specialhigh-speed Integrated Services Digital Network (ISDN) lines, or theInternet.

In the example embodiment, controller server 112 includes a databaseserver 116 connected to database 120, which contains inventoryinformation relating to hosts 114, as described below in greater detail.In one embodiment, centralized database 120 is stored on controllerserver 112 and can be accessed by potential users at one of hosts 114 bylogging onto controller server 112 through one of hosts 114. In analternative embodiment, database 120 is stored remotely from controllerserver 112.

Database 120 may include a single database having separated sections orpartitions or may include multiple databases, each being separate fromeach other. Database 120 may store inventory data generated as part ofinventory scan activities conducted over the network including datarelating to operating systems, applications, and hardware. Database 120may also store information associated with inventory scanning of hosts114, such as scan execution and scheduling information, scan sourcecode, and source code version information. Database 120 may also storeinformation associated with the storing of inventory scanned from hosts114, including standard file format layouts, scan versions, scan versiondates, discovery dates, data security information, scan locations (i.e.,relative or absolute paths on host where a scan occurs), fileconsistency information, mapping schema, scan performance information(i.e., success and failure statistics associated with scans), datahistory, error handling information, and any other information relevantto the storing of scanned information from host 114. As discussed below,inventory information associated with hosts 114 is updated periodically,and is stored within database 120.

FIG. 2 is an expanded block diagram of an example embodiment of a serverarchitecture of a computing infrastructure 122 including other computerdevices in accordance with one embodiment of the present invention.Components in computing infrastructure 122, identical to components ofsystem 100 (shown in FIG. 1), are identified in FIG. 2 using the samereference numerals as used in FIG. 1. Computing infrastructure 122includes controller server 112, hosts 114, and POS terminals 118.Controller server 112 further includes database server 116, and mayinclude a transaction server 124, a web server 126, a fax server 128, adirectory server 130, and a mail server 132. A storage device 134 iscoupled to database server 116 and directory server 130. Servers 116,124, 126, 128, 130, and 132 are coupled to a local area network (LAN)136. In addition, a first host device 138, a second host device 140, anda third host device 142 may be coupled to LAN 136. In the exampleembodiment, first host device 138, second host device 140, and thirdhost device 142 are coupled to LAN 136 using network connection 115.Alternatively, host devices 138, 140, and 142 are coupled to LAN 136using an Internet link or are connected through an Intranet. As usedherein, host devices 138, 140, and 142, as well as controller server112, are referred to, collectively, as “infrastructure” or “computinginfrastructure,” and represent computing devices whose inventory is asubject of interest to system 100. Each host device 138, 140, and 142 isa computing device having an operating system, a set of hardware, andmay include one or more applications.

Controller server 112 is configured to be communicatively coupled tovarious individuals, including employees 144 and to third parties, e.g.,account holders, customers, auditors, developers, consumers, merchants,acquirers, issuers, etc., 146 using an ISP Internet connection 148. Thecommunication in the example embodiment is illustrated as beingperformed using the Internet, however, any other wide area network (WAN)type communication can be utilized in other embodiments, i.e., thesystems and processes are not limited to being practiced using theInternet. In addition, and rather than WAN 150, local area network 136could be used in place of WAN 150.

In the example embodiment, controller server 112 includes databaseserver 116, and may include a transaction server 124, a web server 126,a fax server 128, a directory server 130, and a mail server 132. Inother embodiments, controller server 112 may further include additionalservers to facilitate the processes and methods described herein. Suchadditional servers may include, without limitation, servers forscheduling and coordinating scan activities, servers for receiving data,servers for translating data, servers for inserting data into adatabase, servers for storing database information, servers to assist insecurity tasks such as encryption and decryption, additional servers tofacilitate these functions across security zones, and any other serverwhich may facilitate the processes and methods described herein. In someembodiments, such servers may be physically distinct from one anotherwhile in other embodiments, physical servers may be used for a pluralityof purposes. In other words, while controller server 112 may indicate anindividual server or a plurality of servers, controller server 112 willbe able to facilitate the processes and methods described herein.

In the example embodiment, any authorized individual having aworkstation 154 may be a part of computing infrastructure 122. At leastone of the client systems includes a manager workstation 156 located ata remote location. Workstations 154 and 156 are personal computershaving a web browser. Also, workstations 154 and 156 are configured tocommunicate with controller server 112. Furthermore, fax server 128communicates with remotely located client systems, including a clientsystem 156 using a telephone link. Fax server 128 is configured tocommunicate with other client systems 138, 140, and 142 as well.

Computing infrastructure 122 may also include point-of-sale (POS)terminals 118, which may be connected to controller server 112. POSterminals 118 are interconnected to the Internet through many interfacesincluding a network, such as a local area network (LAN) or a wide areanetwork (WAN), dial-in-connections, cable modems, wireless modems, andspecial high-speed ISDN lines. POS terminals 118 could be any devicecapable of interconnecting to the Internet and including an input devicecapable of reading information from a consumer's financial transactioncard.

FIG. 3 illustrates an example configuration of a user system 202operated by a user 201, such as a system administrator. User system 202may include, but is not limited to, hosts 114, 138, 140, and 142, POSterminal 118, workstation 154, and manager workstation 156. In theexample embodiment, user system 202 includes a processor 205 forexecuting instructions. In some embodiments, executable instructions arestored in a memory area 210. Processor 205 may include one or moreprocessing units, for example, a multi-core configuration. Memory area210 is any device allowing information such as executable instructionsand/or written works to be stored and retrieved. Memory area 210 mayinclude one or more computer readable media.

User system 202 also includes at least one media output component 215for presenting information to user 201. Media output component 215 isany component capable of conveying information to user 201. In someembodiments, media output component 215 includes an output adapter suchas a video adapter and/or an audio adapter. An output adapter isoperatively coupled to processor 205 and operatively couplable to anoutput device such as a display device, a liquid crystal display (LCD),organic light emitting diode (OLED) display, or “electronic ink”display, or an audio output device, a speaker or headphones.

In some embodiments, user system 202 includes an input device 220 forreceiving input from user 201. Input device 220 may include, forexample, a keyboard, a pointing device, a mouse, a stylus, a touchsensitive panel, a touch pad, a touch screen, a gyroscope, anaccelerometer, a position detector, or an audio input device. A singlecomponent such as a touch screen may function as both an output deviceof media output component 215 and input device 220. User system 202 mayalso include a communication interface 225, which is communicativelycouplable to a remote device such as controller server 112.Communication interface 225 may include, for example, a wired orwireless network adapter or a wireless data transceiver for use with amobile phone network, Global System for Mobile communications (GSM), 3G,or other mobile data network or Worldwide Interoperability for MicrowaveAccess (WIMAX).

Stored in memory area 210 are, for example, computer readableinstructions for providing a user interface to user 201 via media outputcomponent 215 and, optionally, receiving and processing input from inputdevice 220. A user interface may include, among other possibilities, aweb browser and client application. Web browsers enable users, such asuser 201, to display and interact with media and other informationtypically embedded on a web page or a website from controller server112. A client application allows user 201 to interact with a serverapplication from controller server 112.

FIG. 4 illustrates an example configuration of a server system 301 suchas controller server 112 (shown in FIGS. 2 and 3). Server system 301 mayinclude, but is not limited to, database server 116, transaction server124, web server 126, fax server 128, directory server 130, and mailserver 132.

Server system 301 includes a processor 305 for executing instructions.Instructions may be stored in a memory area 310, for example. Processor305 may include one or more processing units (e.g., in a multi-coreconfiguration) for executing instructions. The instructions may beexecuted within a variety of different operating systems on the serversystem 301, such as UNIX®, LINUX, Microsoft Windows®, etc. It shouldalso be appreciated that upon initiation of a computer-based method,various instructions may be executed during initialization. Someoperations may be required in order to perform one or more processesdescribed herein, while other operations may be more general and/orspecific to a particular programming language (e.g., C, C#, C++, Java,or other suitable programming languages, etc.).

Processor 305 is operatively coupled to a communication interface 315such that server system 301 is capable of communicating with a remotedevice such as a user system or another server system 301. For example,communication interface 315 may receive requests from hosts 114 via theInternet, as illustrated in FIGS. 1 and 2.

Processor 305 may also be operatively coupled to a storage device 134.Storage device 134 is any computer-operated hardware suitable forstoring and/or retrieving data. In some embodiments, storage device 134is integrated in server system 301. For example, server system 301 mayinclude one or more hard disk drives as storage device 134. In otherembodiments, storage device 134 is external to server system 301 and maybe accessed by a plurality of server systems 301. For example, storagedevice 134 may include multiple storage units such as hard disks orsolid state disks in a redundant array of inexpensive disks (RAID)configuration. Storage device 134 may include a storage area network(SAN) and/or a network attached storage (NAS) system.

In some embodiments, processor 305 is operatively coupled to storagedevice 134 via a storage interface 320. Storage interface 320 is anycomponent capable of providing processor 305 with access to storagedevice 134. Storage interface 320 may include, for example, an AdvancedTechnology Attachment (ATA) adapter, a Serial ATA (SATA) adapter, aSmall Computer System Interface (SCSI) adapter, a RAID controller, a SANadapter, a network adapter, and/or any component providing processor 305with access to storage device 134.

Memory area 310 may include, but are not limited to, random accessmemory (RAM) such as dynamic RAM (DRAM) or static RAM (SRAM), read-onlymemory (ROM), erasable programmable read-only memory (EPROM),electrically erasable programmable read-only memory (EEPROM), andnon-volatile RAM (NVRAM). The above memory types are example only, andare thus not limiting as to the types of memory usable for storage of acomputer program.

FIG. 5a is a flowchart 500 illustrating an example process utilized bysystem 100 (shown in FIG. 1) for scanning infrastructure for inventorydata. Components in flowchart 500, identical to components of system 100(shown in FIG. 1) and computing infrastructure 122 (shown in FIG. 2),are identified in FIG. 5a using the same reference numerals as used inFIGS. 1 and 2. In the example embodiment, controller server 112 (shownin FIGS. 1 and 2) deploys and executes a scan program 501 to collectinventory data on computing devices 114 in a computing infrastructure,such as computing infrastructure 122. Controller server 112 communicateswith computing devices 114 such as first host device 138 and second hostdevice 140 through a network, such as network 115, a local area network(LAN), an intranet (i.e., a private network spanning geographic areas),and the Internet.

During operation, controller server 112 deploys 502 the scan program 501to a plurality of hosts, such as first host device 138 and second hostdevice 140. In some embodiments, deploying 502 the scan program 501includes using file transfer protocol (FTP) to transfer the scan program501 to the host. In other embodiments, deploying 502 the scan program501 includes using secure file transfer protocol (SFTP) to transfer thescan program 501 to the host. Alternatively, any method of transferringthe scan program 501 to a host that enables operation of the systems andmethods described herein may be used.

In the example embodiment, a first tool set 520 is installed 504 ontofirst host device 138, and a second tool set 522 is installed 506 ontosecond host device 140. First host device 138 includes a first operatingsystem, and second host device 140 includes a second operating systemthat is different from the first operating system. The first tool set520 and second tool set 522 are sets of tools configured to allowexecution a single scan program, e.g., the scan program 501, undermultiple operating systems. For example, first host device may include aWindows®-based operating system, and second host device may include aSolaris®-based operating system. (Solaris is a registered trademark ofOracle Corporation, Redwood City, Calif.). In one embodiment, the scanprogram 501 is one or more scripts written in Perl. As such, the firsttool set 520 may include a Perl interpreter for Windows®, therebyenabling scan program 501 to execute on the first host device 138.Further, the second tool set 522 may include a Perl interpreter forSolaris®, thereby enabling scan program 501 to execute on the secondhost device 140. In another embodiment, scan program 501 includes anyother computer language, and the tool set (520 or 522) includes acorresponding interpreter for interpreting the computer language. Inother words, first tool set 520 and the second tool set 522 includes anyoperating system specific libraries and binaries necessary to perform asuccessful scan. For example, the use of a scan program written in aparticular language (e.g., Java) would cause an associated tool set toinclude a corresponding interpreter (e.g., a Java interpreter). Inanother example, the use of cryptographic or security protocols in ascan would cause an associated tool set to include an associated programfor extracting encrypted data (e.g., OpenSSL).

Further, in the example embodiment, scan program 501 includes operatingsystem-specific operations conditioned, at runtime, to execute certaininventory collection functions using different mechanisms, depending onthe operating system detected. For example, scan program 501 may firstdetermine the operating system of the host, such as Windows Server 2003or Solaris, by querying the execution host. As used herein, the term“execution host” refers to the particular host upon which an instance ofthe scan program 501 is running. In another embodiment, the scan programmay be instructed at to the underlying operating system of the executionhost, such as through a command-line parameter at the time of execution,or through an environment variable, or a configuration file associatedwith the scan program on the particular execution host. For example, ifthe scan program utilizes Perl, and if the operating system of theexecution host device is stored in the variable “$^0”, such as throughpassing the operating system type as a command line parameter duringexecution, the scan program 501 may set the operating system platform,here “$platform”, as such:

if(${circumflex over ( )}O =~ /solaris/i){ $platform=“solaris”; }elsif(${circumflex over ( )}O =~ /aix/i){ $platform=“aix”; }elsif(${circumflex over ( )}O =~ /windows/i ∥ ${circumflex over ( )}O =~/cygwin/i){ $platform=“windows”; } elsif(${circumflex over ( )}O =~/linux/i){ $platform=“linux”; }

Later during execution, in this example embodiment, scan program 501utilizes the “$platform” variable to conditionally process certaininventory information. For example, the scan program 501 may beprogrammed to query the execution host to determine memory information.If the execution host's operating system is Solaris, the scan program501 may execute a system command including the “uname” command, andparse Solaris-style output of “uname” in the format expected fromSolaris. For example:

elsif($platform eq ″solaris″){ print ″INFO: Running on a Solarishost\n”; #Gather uname -a output here print ″DEBUG: Gathering unameinformation\n”; my $unameOut={grave over ( )}uname -a{grave over ( )}; .. . }Or if the execution host's operating system is Linux, the scan program501 may execute a series of system commands including various “uname”commands, and parse Linux-style “uname” output in the format expectedfrom Linux. For example:

elsif($platform eq “linux”){ #Running on Linux print “INFO: Running on aLinux host\n”; #Gather uname output here print “DEBUG: Gathering unameinformation\n”; chomp($hostHash{OS}={grave over ( )}uname -o{grave over( )}); chomp($hostHash{HardwareType}={grave over ( )}uname -i{grave over( )}); chomp($hostHash{ServerType}={grave over ( )}uname -m{grave over( )}); chomp($hostHash{ProcessorType}={grave over ( )}uname -p{graveover ( )}); chomp($hostHash{OSVersion}={grave over ( )}uname -r{graveover ( )}); chomp($hostHash{BuildVersion}={grave over ( )}uname -v{graveover ( )}); . . . }As such, the scan program 501 is able to collect similar data fromdifferent operating systems using a single scan program.

Controller server 112 then executes 508 scan program 501 on the firsthost device 138 for generating a first set of inventory data 530. In theexample embodiment, scan program 501 is configured to gather inventorydata associated with the execution host device, including but notlimited to information related to hardware components, operating systeminformation, and application information. For example, certain hardwarecomponent information such as the number of processors, storagecapacities, and random access memory (RAM) capacities may be collectedby the scan program 501. Further, operating system information such asoperating system version, patch level information, network addresses,and user information may be collected by the scan program 501.

In some embodiments, information about installed applications may begathered as well, such as the presence of certain applications, thesettings of applications, and application-specific information such asversion and patch information, application configuration settings, andnetwork port settings. Further, in some embodiments, the scan programs501 and/or sub-components of the scan programs 501, are executed as aspecific user of the operating system of the execution host. Operatingsystems may include security control restrictions by means of useraccounts (“users”) and authentication. Some inventory data may beaccessible to some users, e.g., “privileged” users may have accesspermission to some inventory data, while access may be restricted toother users, e.g., “non-privileged” users. For example, a Mysql®database may have a privileged user account, “mysql”, through whichdetailed inventory information about the database may be obtained.Non-privileged users of the operating system, on the other hand, may nothave access to the detailed inventory information of the database. Thus,in some configurations, scan program 501 may necessitate execution ofinventory information collection through use of one or more privilegeduser accounts. Moreover, in some embodiments, the controller server 112may execute some inventory collection as a particular user using toolsfor executing commands as particular users, such as, for example, the“sudo” utility in a Unix-based operating system. This example embodimentmay require a tool installation and/or configuration change.

Similarly, controller server 112 also executes 510 scan program 501 onthe second host device 140 for generating a second set of inventory data530. Each set of inventory data, such as first set of inventory data 530and second set of inventory data 532, is stored locally on the executionhost, such as first host device 138 and second host device 140,respectively. After a set of inventory data, such as the first set ofinventory data 530, is created by the scan program 501, controllerserver 112 collects 512 the inventory files and stores the inventorydata in the database 120. The format of these sets of inventory data, aswell as the parsing and storing of the data in database 120, isdiscussed in greater detail below.

During operation, controller server 112 orchestrates deployment of scanprograms 510. In some embodiments, subsequent versions of scan programsmay be deployed to replace previous versions. Controller server 112 maydelete, overwrite, or otherwise replace a prior version of the scanprogram with a new scan program. Subsequent executions of thelater-version scan programs may gather information different from theearlier-version scan programs. Each execution of the scan programs 501may be alterable by parameters of the execution, or through aconfiguration file. In some embodiments, controller server 112 deploysand manages configuration files associated with the scan program 501. Insome embodiments, controller server 112 deploys an agent to theexecution host, such as first host device 138 and second host device140, respectively. The agent represents a program which is installed onthe execution host which executes the scan when contacted by controllerserver 112. Controller server 112 may contact the agent on a scheduledbasis. In other embodiments, controller server 112 does not use an agenton the execution host. Instead, controller server 112 will have theability to log into the execution host remotely (e.g., by using logincredentials such as an SSH keypair). Controller server 112 logs into theexecution host to execute the scan.

Further, controller server 112 orchestrates scheduling and execution ofeach scan program ran on each computing device 114, such as first hostdevice 138 and second host device 140. Controller server 112 may useseveral methods for scheduling the execution of each scan program oneach computing device 114, such as OS-supplied scheduling tools,enterprise scheduling software, and custom-written scheduling software.In one example, controller server 112 may use scheduling softwareincluded in an operating system distribution. Operating systems ofteninclude such scheduling software (e.g., UNIX cron) which allows forcommands to be executed at specific times. In another example,controller server 112 may use enterprise scheduling software. Enterprisescheduling software includes software designed to schedule commandexecution and track command execution success in a robust fashion. Forexample, enterprise scheduling software may include reporting views oralerts to allow for scalable scheduling of command execution. In anadditional example, controller server 112 may use custom schedulingmethods. For instance, most programming languages include libraries andapplication programming interfaces (APIs) which may be used to scheduletasks. In this example, such libraries and APIs are used to create anapplication which is used to manage and schedule tasks.

In addition, controller server 112, in the example embodiment, monitorsthe execution of each scan program ran on each system. Controller server112 may alter an aspect of operation of the scan program, such asterminating and/or re-executing the scan programs. In some embodiments,controller server 112 maintains a “timeout period” for each running scanprogram. If a scan program runs longer than the timeout period, thecontroller server 112 terminates the scan program. Such a timeout periodmay be configured by individual scan, by a group of scans, or as asingle default for all scans. The controller server 112 may then collectthe partial output from the failed scan, and may analyze the output forerror. Execution output of the scan program may be directed to logfiles, and controller server 112 may collect and analyze those log filesfor success and/or failure indications. Timestamps may be implemented tofacilitate troubleshooting. In some embodiments, controller server 112may use an agent program to facilitate tasks including, withoutlimitation, monitoring scan programs, terminating scan programs,executing scan programs, changing users, detecting timeouts, andre-executing scan programs. The agent program is deployed by controllerserver 112 and resident on each system. The agent program can be calledby controller server 112 to facilitate the tasks described above.

In the example embodiment, controller server 112 maintains the outputfiles associated with the scan programs. Controller server 112 maychange file permissions of the output files, such as ownership, and maydelete and/or archive old files. An example output file is illustratedin this example:

<?xml version=“1.0” encoding=“utf-8” ?> <scan date=“20120922”host=“was2stl61” table=“Host” version=“2.3.4.DR”> <Host><BuildVersion>Generic_144488-06</BuildVersion> <CPUSpeed>1415</CPUSpeed><HardwareType>SUNW,SPARC-Enterprise-T5220</HardwareType><Host>DC1-host1</Host> <OS>SunOS</OS> <OSVersion>5.10</OSVersion><ProcessorType>sparcv9</ProcessorType> <RealMemory>32G</RealMemory><ServerType>sparc</ServerType> <TotalNumberCPU>64</TotalNumberCPU><TotalSwapSpace>1673.4921875M</TotalSwapSpace> </Host> </scan>

FIG. 5b is a flowchart 550 illustrating another embodiment of theprocess shown in FIG. 5a for scanning infrastructure for inventory data.Components in flowchart 550, identical to components of system 100(shown in FIG. 1), computing infrastructure 122 (shown in FIG. 2), andflowchart 500 (shown in FIG. 5a ), are identified in FIG. 5b using thesame reference numerals as used in FIGS. 1, 2, and 5 a. In this exampleembodiment, local scan controllers 552 and one or more sub-scans 554,556 are deployed 502 to host devices 138, 140. Local scan controller 552orchestrate local execution of one or more sub-scans 554, 556 duringexecution 508, 510. Each of the sub-scans 554, 556 collect andcontribute a subset of the inventory results 530, 532. In someembodiments, sub-scans 554, 556 may be delineated by individualapplication, such as an “Apache” sub-scan for collecting informationrelated to an Apache installation on the local host 140. In otherembodiments, sub-scans 554, 556 may be delineated based on userprivilege, such as when a subset of information needs to be collected bya specific user account. In still other embodiments, sub-scans 554, 556may be delineated based on time of day, type of information to becollected, user-specific request, or any other appropriateorganizational scheme. The results of each sub-scan 554, 556 are storedin inventory sets 530, 532.

FIG. 6 is a flowchart illustrating an example process utilized by system100 (shown in FIG. 1) for storing computer infrastructure inventory datascanned using process 500 (shown in FIG. 5a ). Components in flowchart600, identical to components of system 100 (shown in FIG. 1) andcomputing infrastructure 122 (shown in FIG. 2), are identified in FIG. 6using the same reference numerals as used in FIGS. 1 and 2. In theexample embodiment, controller server 112 receives 610 inventory files601 and 602 from host devices 138 and 140. In alternative embodiments,controller server 112 may receive inventory files any of host devices114, 140, 142 (shown in FIG. 2), or other similar host devices. Althoughtwo host devices are shown in FIG. 6, any number of host devices may beused with the process of FIG. 6 and the same number of inventory fileswill be received by controller server 112. Controller server 112communicates with computing devices such as host device 138 through anetwork, such as network 115 (shown in FIG. 2), a local area network(LAN), an intranet (i.e., a private network spanning geographic areas),and the Internet.

In the example embodiment, host devices 138 and 140 represent twoheterogeneous host devices 138 and 140. The heterogeneity of hostdevices 138 and 140 may be reflected in different operating systems,resident applications, computer hardware, or any other distinction. Inthe example embodiment, host device 138 runs a Linux-based operatingsystem and host device 140 runs a Solaris-based operating system.Further, host device 138 is used primarily to serve applications byusing an application server (e.g., Apache TomEE) while host device 140is used primarily to serve web content by using a web server (e.g.,Apache HTTP). (Apache, Apache HTTP, and Apache TomEE are trademarks ofApache Software Foundation, Los Angeles, Calif.) Additionally, hostdevices 138 and 140 run using different physical hardware includingdistinct manufacturers and architectures.

FIG. 6 illustrates one example of storing computer infrastructureinventory data. In other examples additional host devices 138 or 140 maybe scanned and have associated inventory data stored. In some examples,the host devices may be completely homogeneous, completelyheterogeneous, or partly heterogeneous.

Inventory files 601 and 602 represent an inventory files created bycontroller server 112 as inventory sets 530 or 532 (shown in FIG. 5a ).In the example embodiment, receiving 610 inventory files 601 and 602includes collecting 512 (shown in FIG. 5a ) inventory sets. In theexample embodiment, inventory files 601 and 602 use a standardized fileformat that may be interpreted and written on a plurality of operatingsystems. However, the data and layout of inventory files 601 and 602 mayvary significantly depending upon differences between host devices 138and 140 and scans running on host devices 138 and 140. In other words,while inventory files 601 and 602 may be written and read on a pluralityof operating systems, the data and layout may not be consistent. Suchdata may be generated using PHP objects persisted to file, XML data,Perl data persisted to file, or any other file format. Therefore,inventory files 601 and 602 may be heterogeneous in accordance with theheterogeneity of host devices 138 and 140.

In the example embodiment, inventory files 601 and 602 include datagenerated from a scan of host device 138 including the scan date and thedatabase table associated with the scan. The scan date represents thedate that the scan was executed. In at least some embodiments, scan dateincludes a time along with the date. The database table associated withthe scan represents the database table that inventory files 601 and 602should be associated with and written to.

In at least some embodiments, inventory files 601 and 602 are reviewedto determine the scan date. In such embodiments, controller server 112will determine whether inventory files 601 and/or 602 are outside of arequired date range. The required date range may be set by user 201(shown in FIG. 3) of controller server 112 or by system defaults. Inthese embodiments, inventory files 601 and/or 602 will be purged if itis determined that either or both are outside of the required daterange. Further, controller server 112 will request new inventory files601 and/or 602.

In alternative embodiments, inventory files 601 and 602 include a scanversion. The scan version represents the version of the scanningalgorithm used to create inventory files 601 and 602. Scan versions mayinclude identifiers written into the scan program 501 (shown in FIG. 5a) including alphanumeric identifiers or any other identifier which mayuniquely represent a scan version. Scan versions may indicate changes inthe scope or features of the scanning algorithm.

In at least some embodiments, inventory files 601 and 602 are reviewedto determine scan versions. In such embodiments, controller server 112will determine whether inventory files 601 and/or 602 match acceptablescan versions. The acceptable scan versions may be set by user 201 ofcontroller server 112 or by system defaults. In these embodiments,inventory files 601 and/or 602 will be purged if it is determined thateither or both do not match acceptable scan versions. Further,controller server 112 will request new inventory files 601 and/or 602.In some examples, controller server 112 will redeploy scan program 501to hosts 138 and/or 140 to ensure that a newly executed scan will matchthe acceptable scan versions.

In the example embodiment, controller server 112 receives 610 inventoryfiles 601 and 602 from a predetermined location on host device 138 andhost device 140. The predetermined location is a constant, persistentlocation for inventory files 601 and 602 to be located on a plurality ofhosts 138 and 140. The predetermined location represents a persistentrelative or absolute path with respect to the host. For example, thepredetermined location may be in a fixed location such as “C:\scans” ora persistent relative path such as, “localhost\localdata\scans.” Apredetermined location may be of use in storing inventory data becauseit may avoid any difficulties in locating inventory files 601 and 602.In alternative embodiments, a location other than a predeterminedlocation may be used in receiving 610 inventory files 601 and 602. Insuch embodiments, the location is a fordable location within host device138 and/or 140.

In the example embodiment, receiving 610 inventory files 601 and 602includes using an automated process. The automated process may be anyprocess using programs or methods for scheduling, (e.g., a cron job,scheduling software, scripts, batch processes or any other method forautomating the transmission and receipt 610 of inventory files 601 and602. Automated processes may be useful to ensure that all inventoryfiles 601 and 602 are received consistently.

In the example embodiment, receiving 610 inventory files 601 and 602includes using a secure file transfer method. Using a secure filetransfer method may include using SFTP, SSH, Secure Copy, FASP, or anyother method capable of transmitting data in a secure manner.

In the example embodiment, receiving 610 inventory files 601 and 602includes using a method to ensure file consistency during transfer.Ensuring file consistency during transfer may be of importance due todata corruption, manipulation, interrupted communication, or any otherissue that may result in inconsistency in files between transmission andreceipt. The method of ensuring file consistency includes receiving hostfile characteristics of inventory files 601 and 602 on host devices 138and 140. The method also includes determining local file characteristicsof inventory files 601 and 602 on controller server 112. The methodfurther includes comparing file characteristics of host devices 138 and140 to local file characteristics. The method additionally includesrequesting a retransmission of inventory file 601 if host filecharacteristics do not match local file characteristics.

Controller server 112 next receives 620 mapping schema 621 and 622 fromhost devices 138 and 140. Mapping schema 621 and 622 are associated withinventory files 601 and 602, respectively. Mapping schema 621 and 622represents data explaining the layout of inventory files 601 and 602.Mapping schema 621 and 622 may include, without limitation, an XML file,an HTML file, a text file, or any other file format capable ofdescribing inventory files 601 and 602. Mapping schema 621 and 622allows controller server 112 to interpret the significance of data ininventory files 601 and 602 with respect to the scanning of host devices138 and 140. In other words, mapping schema 621 and 622 is a descriptionof a structured relationship between inventory files 601 and 602 andinventory records 631 and 632, respectively (described further below).

In some embodiments, controller server 112 receives 620 mapping schema621 and 622 by extracting data from inventory files 601 and 602,respectively. In these embodiments, inventory files 601 and 602 are“self-describing.” In other words, inventory files 601 and 602 containdata explaining their layouts. In these embodiments, data is extractedfrom inventory files 601 and 602 and used to interpret the significanceof data in inventory files 601 and 602 with respect to the scanning ofhost devices 138 and 140.

Because the layout and data of inventory files 601 and 602 may vary,mapping schema 621 and 622 may similarly vary. For instance, mappingschema 621 may indicate the presence of scan data relevant to anapplication server (e.g., listener ports, applications, or data sources)which are present in inventory file 601 but not inventory file 602.Conversely, mapping schema 622 may indicate the presence of scan datarelevant to a web server (e.g., virtual hosts) which are present ininventory file 602 but not inventory file 601. Further, the layout ofinventory files 601 and 602 may vary because inventory file 601 iswritten with persistent PHP objects while inventory file 602 is writtenusing XML. Accordingly, mapping schema 621 will indicate a layoutassociated with persistent PHP objects while mapping schema 622 willindicate a layout associated with XML.

Controller server 112 further translates 630 inventory files 601 and 602to inventory records 631 and 632 using mapping schema 621 and 622,respectively. In the example embodiment, controller server 112translates 630 using a program capable of receiving inventory files 601and 602, processing inventory file 601 and 602 using mapping schema 621and 622, and outputting an inventory records 631 and 632. The programmay include, without limitation, Java, Perl, JDBC, ODBC, scriptinglanguages, C#, or any other methods capable of being used to receive andinterpret a structured file and outputting inventory records 631 and632.

Translating 630 inventory files 601 and 602 to inventory records 631 and632 also represents converting data that may be heterogeneous tohomogeneous inventory records 631 and 632. As described above, hostdevices 138 and 140 may vary in many respects including, withoutlimitation, operating systems, physical hardware, applications,function, and data. Also as described above, inventory files 601 and 602may vary due to these reasons and/or divergent scanning methods.However, inventory records 631 and 632, while having potentiallyheterogeneous data, can be entered into a shared database system,inventory database system 641. Therefore, translating 630 representsallowing for heterogeneous data represented in heterogeneous datalayouts and scanned from heterogeneous host devices to be represented ina homogeneous database, inventory database system 641.

In the example embodiment, the program used to translate 630 inventoryfiles 601 and 602 writes a record in inventory records 631 and 632 forthe date and time of discovery of host devices 138 and 140 if a recordfor the date and time of discovery of host devices 138 and 140 did notpreviously exist. As used herein, discovery refers to the first timethat host devices 138 and 140 were detected using the scanning methodsdescribed herein.

In the example embodiment, the program used to translate 630 inventoryfiles 601 and 602 writes a record in inventory records 631 and 632 forthe date and time of the update of inventory records 631 and 632. Thedate and time of the update represent the date and time of the scan ofhost devices 138 and 140.

Controller server 112 next updates 640 inventory database system 641with inventory records 631 and 632. Updating 640 represents insertinginventory records 631 and 632 into inventory database system 641.Updating 640 may use any relevant method including SQL, ODBC, JDBC, orany other method capable of inserting a database record into a database.

In the example embodiment, controller server 112 determines if update640 was successful. Determining if update 640 was successful may beaccomplished by trapping for received messages from inventory databasesystem 641, querying inventory database system 641 after update 640, orany other method for verifying inventory database system 641 wassuccessfully updated 640. In such embodiments, controller server 112purges inventory files 601 and 602 from memory 210 (shown in FIG. 2)upon determining update 640 was successful.

In the example embodiment, controller server 112 also includes errorhandling methods. Error handling methods represent modules or programsthat can determine if an error has occurred in process 600 including,without limitation, data corruption of inventory files 601 and 602 ormapping schema 621 and 622, security issues in transferring inventoryfiles 601 and 602 or mapping schema 621 and 622, unacceptable scanversions or scan dates for inventory files 601 and 602, failure totranslate 630 inventory files 601 and 602 to inventory records 631 and632, or failure to update 640 inventory database system 641 withinventory records 631 and 632. Error handling methods also may becapable of responding by reinitiating a scan, a request for inventoryfiles 601 and 602, a request for mapping schema 621 and 622, translating630 inventory files 601 and 602, updating 640 inventory database system641, or alerting user 201 of the existence of an error.

FIG. 7 is a block diagram of an example computer infrastructure 700 withrelationships between infrastructure components which may be scannedusing the process in FIG. 5a and stored using the process in FIG. 6.Computer infrastructure 700 is an architecture that may be used to servea website (not shown).

Incoming requests from a web user enter computer infrastructure 700 at afirst load balancer 702. First load balancer 702 sends requestinformation to one of two web systems, first web system 706 and secondweb system 708. First web system 706 and second web system 708 arephysical or virtual systems which contain software capable of servingweb content to the web user. In this example, a relationship existsbetween first load balancer 702 and each of first web system 706 andsecond web system 708. Any of these inventory components, when scanned,may contain information indicating that they are associated with theothers. For example, an inventory file 601 (shown in FIG. 6) createdafter a scan of first load balancer 702 may contain information relatedto the hosts to which it can route traffic. Therefore, inventory file601 generated by a scan of first load balancer 702 may contain dataindicating that it load balances (i.e., it directs traffic efficiently)to first web system 706 and second web system 708. Such data may includea reference to the hostname associated with first web system 706 andsecond web system 708, an IP address associated with first web system706 and second web system 708, or any other identifying informationwhich can uniquely identify first web system 706 and second web system708. Similarly, an inventory file 601 created after a scan of first websystem 706 may indicate that it receives traffic from load balancer 702.

An inventory file 601 created after a scan of first web system 706 mayalso indicate that it serves web content with other servers. Inventoryfile 601 associated with first web system 706 may contain specificreferences to all of the web systems which can serve content similar tofirst web system 706. As these web systems share a similar function,knowledge of such relationships may be valuable. Accordingly, aninventory file 601 created after a scan of first web system 706 maycontain data indicating that it serves web content in a group includingsecond web system 708.

Web systems include software which are able to serve web content andcall upon web applications which may be required to properly servecontent. Such software is known as a web server. First web system 706and second web system 708 include first web server 712 and second webserver 714, respectively. As discussed above, applications may producetheir own scan data. Accordingly, a relationship between web systems andweb applications may be written on their respective inventory files 601.For example, an inventory file 601 created after a scan of first websystem 706 may include data indicating that it is hosting first webserver 712. Alternately, an inventory file 601 created after a scan offirst web server 712 may indicate that it is hosted on first web system706. In either case, first web system 706 or first web server 712 areidentified as described above (e.g., using a unique identifier).

Web servers include virtual hosts which receive traffic directed by aload balancer. Virtual hosts allow web servers to serve distinct websites from the same physical machine in a manner that is transparent tothe user. In this example, incoming requests are routed through firstload balancer 702 to one of first virtual hosts 722 associated withfirst web server 712 or second virtual hosts 724 associated with secondweb server 724. A relationship between virtual hosts and web servers maybe written on their respective inventory files. For example, aninventory file 601 created after a scan of first web server 712 mayindicate that it is hosting first virtual hosts 722 while an inventoryfile 601 created after a scan of second web server 714 may indicate thatit is hosting second virtual hosts 724.

In typical web serving architecture such as computer architecture 700,application content may be served in conjunction with web content. Insuch architectures, a particular virtual host may send a request for anapplication to be run and served to provide such application content. Incomputer architecture 700, first virtual hosts 722 and second virtualhosts 724 can route requests for application content to second loadbalancer 704. Second load balancer 704 will route traffic to one offirst application system 732 and second application system 734.Relationships between virtual hosts 722 and 724, second load balancer704, and application systems 732 and 734 may be written to theirrespective inventory files 601 consistent with the manner describedabove.

Application systems represent physical or virtual systems which can hostapplication servers which may receive application requests and executesuch requests. For example, first application system 732 hosts and runsfirst application server 736. Accordingly, a relationship exists betweenfirst application system 732 and first application server 736. Ananalogous relationship exists between second application system 734 andsecond application server 738. Such relationships may be written totheir respective inventory files 601 consistent with the mannerdescribed above.

Application servers include listeners which can receive applicationrequests, applications which execute such application requests, anddatasources which may call external data. Datasources allow applicationsto refer to data for a variety of purposes including personalizationspecific to the user, incorporating external data into applications, andfacilitating transactions in an application. For example, firstapplication server 736 includes first listener 742, first application746, and first datasource 752. Accordingly relationships exist betweenfirst application server 736 and first listener 742, first application746, and first datasource 752. An inventory file 601 created after ascan of first application server 736 may include data indicating firstlistener 742, first application 746, and first datasource 752.Conversely, scans of each of first listener 742, first application 746,and first datasource 752 may create inventory files 601 which indicatethat they are hosted on first application server 736.

Application servers connect to database systems by using datasourcereferences. For instance, first application server 736 may make a callto database system 762, and specifically to first database 764, basedupon first datasource 752. Accordingly, relationships exist between atleast datasources 752 and 754 and database system 762. An inventory file601 created after a scan of first datasource 752 may include dataindicating database system 762.

Database systems may host multiple databases. For example databasesystem 762 hosts first database 764 and second database 766.Accordingly, relationships exist between database system 762 anddatabases 764 and 766. An inventory file 601 created after a scan ofdatabase system 762 may include data indicating first database 764 andsecond database 766.

FIG. 8 is a flowchart illustrating an example process 800 utilized bysystem 100 (shown in FIG. 1) for crawling computer infrastructureinventory data stored 600 (shown in FIG. 6) to identify relationships incomputer infrastructure inventory such as relationships shown inarchitecture 700 (shown in FIG. 7). Process 800 is executed bycontroller server 112. As indicated above, controller server 112 (shownin FIG. 2) may indicate the same physical system as shown in FIGS. 1 and2 or a different physical system capable of executing process 800.

In the example embodiment, controller server 112 retrieves 810 aninventory record 814 from an inventory database system 812. Inventoryrecord 814 represents a data record such as inventory record 631 or 632(shown in FIG. 6) generated by translating an inventory file 601 (shownin FIG. 6) based upon a scan of a host such as host device 138 or 140(shown in FIG. 6). Inventory database system 812 represents a databasefor storing inventory records 814 such as inventory database system 641(shown in FIG. 6). As described above, inventory record 814 may containany information which is generated based upon a scan of a host such ashost device 138 or 140. Inventory record 814 contains inventory datarepresentative of information which may be parsed from inventory record814 to determine information regarding the host such as scanned hostdevice 138 or 140. Such inventory data may include relational metadata822.

In the example embodiment, inventory record 814 is stored as a uniquetable in inventory database system 812. In other embodiments, inventoryrecord 814 may be a non-unique table, a row within a table, or any otherdata structure capable of being stored in inventory database system 812and used as described herein.

Relational metadata 822 represents information which indicates thatinventory record 814 is related to other inventory records. Arelationship between inventory record 814 and other inventory recordsfurther indicates that the inventory associated with inventory record814 is associated with inventory associated with the other inventoryrecords. For example, in computer architecture 700, first web server 712may be scanned to produce inventory file 601. As described above, firstweb server 712 is related to first web system 706 as first web server712 is installed upon and running on first web system 706. The inventoryfile 601 generated by scanning first web server 712 may accordinglyinclude data such as:

-   -   ServerName: www.websiteserved.com    -   Instance: First Web Server 712    -   Host: First Web System 706    -   Version: 1.1.1    -   Document Root: /docs    -   Discovered: 2013-03-15 11:11:11    -   Updated: 2013-04-01 11:11:11

As described above, after scanning, the inventory file 601 of the aboveexample is translated to inventory record 631 using mapping schema 621(both shown in FIG. 6). When controller server 112 retrieves inventoryrecord 814 (corresponding to inventory record 631) from inventorydatabase system 812, inventory record 814 will contain relationalmetadata 822. Specifically, relational metadata 822 is represented by“Host: First Web System 706.” This relational metadata 822 indicatesthat first web system 706 (and its associated inventory file 601 andinventory record 814) are related to first web server 712.

Controller server 112 determines 820 that inventory data containsrelational metadata 822. In the example embodiment, determining 820 thatinventory data contains relational metadata 822 includes identifyingprimary relational columns known to contain values corresponding torelational inventory metadata and determining if primary relationalcolumns have non-null values. In the above example, data represented ininventory file 601 as “Host: First Web System 706” will alwaysassociate, if non-null, to a host. In the example embodiment, all hostsare scanned. Further, when inventory file 601 is translated to inventoryrecord 631, “Host: First Web System 706” may translate to an inventoryrecord 631 with a column header of “Host” and a value of “First WebSystem 706” which is stored in a row specific to first web server 712.Therefore, the presence of a column “Host” is a primary relationalcolumn known to contain values corresponding to relational inventorymetadata 822. In other words, if a record has a non-null value for thecolumn corresponding to “Host,” the record is known to containrelational inventory metadata 822 and therefore relate to anotherinventory record 814.

In the example embodiment, identifying primary relational columns knownto contain values corresponding relational inventory metadata 822includes retrieving a relational schematic of inventory database system812. The relational schematic includes data associating inventoryrecords 814 within inventory database system 812 based upon uniquemetadata. In other words, in the example embodiment, potentialrelationships between inventory records 814 are identified in arelational schematic which shows all possible links between inventoryrecords 814. For example, in computer architecture 700 the relationalschematic would indicate that inventory records 814 for applicationservers 736 (shown in FIG. 7) may include columns referring to firstlistener 742, first application 746, and first datasource 752. Therelational schematic may be generated manually, using an automatedprogram, or a combination thereof. In the example embodiment, anautomated program may be used to scan all inventory records 814 frominventory database system 812 to determine the presence of relationalinventory metadata 822.

Controller server 112 next determines 830 that relational inventorymetadata 822 indicates at least one related inventory record 832. Atleast one related inventory record 832 refers to all inventory records814 referenced by relational metadata 822. In the above example,inventory record 814 for first web server 712 will contain relationalmetadata 822 indicating that there is a related inventory record 832associated with first web system 706. As indicated, more than onerelated inventory record 832 may be determined.

In the example embodiment, controller server 112 determines 830 thatrelational inventory metadata 822 indicates at least one relatedinventory record 832 by retrieving primary values from primaryrelational columns, identifying potentially related inventory records832, stored as potentially related tables, from inventory databasesystem 812, identifying secondary relational columns known to containvalues corresponding to primary relational columns, and comparing valuesof secondary relational columns to primary values. In the above example,retrieving first primary values from primary relational columnsrepresents retrieving values for “Host” from inventory record 814associated with a scan of first web server 712. Retrieving these valuesresults in retrieving “First Web System 706.” Identifying potentiallyrelated inventory records 832 represents identifying all web systemswhich may have a value corresponding to “First Web System 706.” Incomputer architecture 700, this represents identifying first web system706 and second web system 708. Identifying secondary relational columnsknown to contain values corresponding to primary relational columnsrepresents finding host data in columns in inventory records 814corresponding to first web system 706 and second web system 708. Forexample, inventory record 814 for first web system 706 may include:

-   -   Build Version: Build 7.1    -   CPU Speed: 3200    -   Host: WebSys1

Alternately, inventory record 814 for second web system 708 may include:

-   -   Build Version: Build 7.1    -   CPU Speed: 3200    -   Host: WebSys1

Therefore, the secondary relational column (the value for “Host” frominventory records 814 associated with first web system 706 and secondweb system 708) may correspond to primary relational columns present ininventory records 814 associated with first web server 712 (the valuefor “Host”). Comparing the values of “Host” from first web system 706and second web system 708 to the value of “Host” for first web server712 will indicate that first web server 712 is related to first websystem 706 but not to second web system 708.

When at least one related inventory record 832 is determined to beindicated, controller server 112 performs 840 first step 810 (retrievingan inventory record 814) from an inventory database system 812 using atleast one related inventory record 832. In other words, controllerserver 112 “crawls” from inventory record 814 to at least one relatedinventory record 832. When more than one related inventory record 832 isdetermined 830, controller server 112 will perform 840 first step 810using each related inventory record 832. As a result, controller server112 will continue to crawl until no connections can be determined froman initial inventory record 814.

In the example embodiment, controller server 112 will identifyrelationships between inventory record 814 and at least one relatedinventory record 832 and write identified relationships to a relationaloutput record. The relational output record is representative of dataexplaining the relationship between inventory record 814 and all relatedinventory records 832. The relational output record may take any formsuitable for describing such relationships including, withoutlimitation, a hierarchical database representing the identifiedrelationships, a flat-file representing the identified relationships,and a chart representing the identified relationships.

This written description uses examples to disclose the invention,including the best mode, and also to enable any person skilled in theart to practice the invention, including making and using any devices orsystems and performing any incorporated methods. The patentable scope ofthe invention is defined by the claims, and may include other examplesthat occur to those skilled in the art. Such other examples are intendedto be within the scope of the claims if they have structural elementsthat do not differ from the literal language of the claims, or if theyinclude equivalent structural elements with insubstantial differencesfrom the literal languages of the claims.

What is claimed is:
 1. A computer-implemented method for identifyingrelationships in computer infrastructure inventory data, the methodimplemented by a computing device coupled to a memory device, the methodcomprising: (a) using the computing device to retrieve an inventoryrecord from an inventory database system, the inventory recordcontaining inventory data representing a first set of computerinfrastructure inventory components generated from a first scan of afirst host system, the first set of computer infrastructure inventorycomponents including at least one hardware component; (b) determiningthat the inventory data contains relational inventory metadata by:identifying primary relational columns known to contain valuescorresponding to relational inventory metadata; and determining ifprimary relational columns have non-null values; (c) determining thatthe relational inventory metadata indicates at least one relatedinventory record including a second set of inventory data representing asecond set of computer infrastructure inventory components generatedfrom a second scan of a second host system, the second set of computerinfrastructure inventory components including at least one hardwarecomponent, by: retrieving primary values from primary relationalcolumns; identifying potentially related inventory records, stored aspotentially related tables, from the inventory database system;identifying secondary relational columns known to contain valuescorresponding to primary relational columns; and comparing values ofsecondary relational columns to primary values; and (d) performing step(a) by retrieving the at least one related inventory record as theinventory record.
 2. The method of claim 1, further comprising:identifying relationships between the inventory record and the at leastone related inventory record; and writing identified relationships to arelational output record describing a relationship between the inventoryrecord and the at least one related inventory record and a relationshipbetween the first host system and the second host system.
 3. The methodof claim 2, wherein writing identified relationships further compriseswriting identified relationships to a relational output record whereinthe relational output record is at least one of: a hierarchical databaserepresenting the identified relationships; a flat-file representing theidentified relationships; and a chart representing the identifiedrelationships.
 4. The method of claim 1, wherein using the computingdevice to retrieve an inventory record further comprises using thecomputing device to retrieve an inventory record that is stored as aunique row or a unique record in the inventory database system.
 5. Themethod of claim 1, wherein identifying primary relational columns knownto contain values corresponding to relational inventory metadatacomprises: retrieving a relational schematic of the inventory databasesystem, the relational schematic including data associating inventoryrecords within the inventory database system based upon unique metadata.6. The method of claim 1, wherein identifying potentially relatedinventory records from the inventory database system further comprises:scanning all inventory records in the inventory database system for thepresence of relational inventory metadata.
 7. A computer system foridentifying relationships in computer infrastructure inventory datacomprising: a processor; a memory device coupled to said processor; andan inventory database system stored on said memory device, saidinventory database system including computer-executable instructionsallowing the computer to manage stored records, said processorconfigured to: (a) retrieve an inventory record from the inventorydatabase system, the inventory record containing inventory datarepresenting a first set of computer infrastructure inventory componentsgenerated from a first scan of a first host system, the first set ofcomputer infrastructure inventory components including at least onehardware component; (b) determine that the inventory data containsrelational inventory metadata by identification of primary relationalcolumns known to contain values corresponding to relational inventorymetadata and determination of primary relational columns having non-nullvalues; (c) determine that the relational inventory metadata indicatesat least one related inventory record including a second set ofinventory data representing a second set of computer infrastructureinventory components generated from a second scan of a second hostsystem by retrieval of primary values from primary relational columns,identification of potentially related inventory records, stored aspotentially related tables, from the inventory database system, furtheridentification of secondary relational columns known to contain valuescorresponding to primary relational columns, and comparison of values ofsecondary relational columns to primary values, the second set ofcomputer infrastructure inventory components including at least onehardware component; and (d) perform step (a) by retrieving the at leastone related inventory record.
 8. A computer system in accordance withclaim 7 wherein the processor is further configured to: identifyrelationships between the inventory record and the at least one relatedinventory record; and write identified relationships to a relationaloutput record describing a relationship between the inventory record andthe at least one related inventory record and a relationship between thefirst host system and the second host system.
 9. A computer system inaccordance with claim 8, wherein the relational output record is atleast one of: a hierarchical database representing the identifiedrelationships; a flat-file representing the identified relationships;and a chart representing the identified relationships.
 10. A computersystem in accordance with claim 7, wherein the inventory record isstored as a unique row or a unique record in the inventory databasesystem.
 11. A computer system in accordance with claim 7 wherein theprocessor is further configured to retrieve a relational schematic ofthe inventory database system, the relational schematic including dataassociating inventory records within the inventory database system basedupon unique metadata.
 12. A computer system in accordance with claim 7,wherein the processor is further configured to scan all inventoryrecords in the inventory database system for the presence of relationalinventory metadata.
 13. Non-transitory computer-readable storage mediafor identifying relationships in computer infrastructure inventory datahaving computer-executable instructions embodied thereon, wherein, whenexecuted by at least one processor, the computer-executable instructionscause the processor to: (a) retrieve an inventory record from aninventory database system, the inventory record containing inventorydata representing a first set of computer infrastructure inventorycomponents generated from a first scan of a first host system, the firstset of computer infrastructure inventory components including at leastone hardware component; (b) determine that the inventory data containsrelational inventory metadata by identification of primary relationalcolumns known to contain values corresponding to relational inventorymetadata and determination of primary relational columns having non-nullvalues; (c) determine that the relational inventory metadata indicatesat least one related inventory record including a second set ofinventory data representing a second set of computer infrastructureinventory components generated from a second scan of a second hostsystem by retrieval of primary values from primary relational columns,identification of potentially related inventory records, stored aspotentially related tables, from the inventory database system, furtheridentification of secondary relational columns known to contain valuescorresponding to primary relational columns, and comparison of values ofsecondary relational columns to primary values, the second set ofcomputer infrastructure inventory components including at least onehardware component; and (d) perform step (a) by retrieving the at leastone related inventory record.
 14. The non-transitory computer-readablestorage media in accordance with claim 13 wherein thecomputer-executable instructions further cause the processor to:identify relationships between the inventory record and the at least onerelated inventory record; and write identified relationships to arelational output record describing a relationship between the inventoryrecord and the at least one related inventory record and a relationshipbetween the first host system and the second host system.
 15. Thenon-transitory computer-readable storage media in accordance with claim14, wherein the relational output record is at least one of: ahierarchical database representing the identified relationships; aflat-file representing the identified relationships; and a chartrepresenting the identified relationships.
 16. The non-transitorycomputer-readable storage media in accordance with claim 13, wherein theinventory record is stored as a unique row or a unique record in theinventory database system.
 17. The non-transitory computer-readablestorage media in accordance with claim 13 further configured to retrievea relational schematic of the inventory database system, the relationalschematic including data associating inventory records within theinventory database system based upon unique metadata.
 18. Thenon-transitory computer-readable storage media in accordance with claim13, further configured to scan all inventory records in the inventorydatabase system for the presence of relational inventory metadata.